UTH-CCB@BioCreative V CDR Task: Identifying Chemical-induced Disease Relations in Biomedical Text

نویسندگان

  • Jun Xu
  • Yonghui Wu
  • Yaoyun Zhang
  • Jingqi Wang
  • Ruiling Liu
  • Qiang Wei
  • Hua Xu
چکیده

This paper describes the system developed by the UTH-CCB team from the University of Texas Health Science Center at Houston (UTHealth), for the 2015 BioCreative V shared tasks of Track 3 on extraction of chemical disease relation (CDR). We participated in both tasks: Task A for “Disease Named Entity Recognition and Normalization (DNER)” and Task B for “Chemical-induced Diseases Relation Extraction (CID)”. For Task A, we developed a Conditional Random Fields based named entity recognition system and used a general Vector Space Model-based approach for entity normalization. To extract the chemical-induced disease relation, we combined two Support Vector Machines-based classifiers, which were trained on sentenceand documentlevel, respectively. Our system achieved a F1 score of 83.53 for Task A and 57.03 for Task B, demonstrating the effectiveness of machine learning-based approaches for automatic extraction of entities and their relations from biomedical literature.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CD-REST: a system for extracting chemical-induced disease relation in literature

Mining chemical-induced disease relations embedded in the vast biomedical literature could facilitate a wide range of computational biomedical applications, such as pharmacovigilance. The BioCreative V organized a Chemical Disease Relation (CDR) Track regarding chemical-induced disease relation extraction from biomedical literature in 2015. We participated in all subtasks of this challenge. In ...

متن کامل

Assessing the state of the art in biomedical relation extraction: overview of the BioCreative V chemical-disease relation (CDR) task

Manually curating chemicals, diseases and their relationships is significantly important to biomedical research, but it is plagued by its high cost and the rapid growth of the biomedical literature. In recent years, there has been a growing interest in developing computational approaches for automatic chemical-disease relation (CDR) extraction. Despite these attempts, the lack of a comprehensiv...

متن کامل

Chemical-induced disease relation extraction via convolutional neural network

This article describes our work on the BioCreative-V chemical-disease relation (CDR) extraction task, which employed a maximum entropy (ME) model and a convolutional neural network model for relation extraction at inter- and intra-sentence level, respectively. In our work, relation extraction between entity concepts in documents was simplified to relation extraction between entity mentions. We ...

متن کامل

Integrating Word Sequences and Dependency Structures for Chemical-Disease Relation Extraction

Understanding chemical-disease relations (CDR) from biomedical literature is important for biomedical research and chemical discovery. This paper uses a k-max pooling convolutional neural network (CNN) to exploit word sequences and dependency structures for CDR extraction. Furthermore, an effective weighted context method is proposed to capture semantic information of word sequences. Our system...

متن کامل

An enhanced CRF-based system for disease name entity recognition and normalization on BioCreative V DNER Task

Disease plays a central role in many areas of biomedical research and healthcare. However, the rapid growth of disease and treatment research creates barriers to the knowledge aggregation of PubMed database. Thus, a framework of disease mention recognition and normalization has become increasingly important for biomedical text mining. In this work, we utilize conditional random fields (CRFs) to...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015